CDS
Accession Number | TCMCG046C31450 |
gbkey | CDS |
Protein Id | XP_008809666.1 |
Location | complement(join(150651..150881,150955..151668,152201..152497,155656..155713,157430..157573,157708..157964,164909..165115,169263..169436,178588..178655,183459..183517,188278..188353,188459..188564,189737..189846,192417..192581,200225..200310,201483..201547,204259..204306,204929..205048,207339..207414,209188..209285,209379..209523,213321..213433)) |
Gene | LOC103721294 |
GeneID | 103721294 |
Organism | Phoenix dactylifera |
Protein
Length | 1138aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA249070 |
db_source | XM_008811444.3 |
Definition | DNA mismatch repair protein MSH1, mitochondrial isoform X1 [Phoenix dactylifera] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGCACCGTGTGGTAACCAGCTCCCTCGTGGCGTCCTCACCTCGCTGGCTCTCACTCGTGGGTTTTCTCCGATCTTCCACCATCCGGAGGTTCTACAAAACGCCGTTTCCAACATGGTGTTGCAAGCTTGTAGAGAGAAAATATTGTTCTAACTCACATGAAATTCTGGTTGGAGTGCCTAAAGCTTCTAGAAGATTAAAACAATCAAAAATTCTTTATGAGGTAGACAATCAGTCACACATTTTGTGGTGGAAAGAGAAAATGCAGATGTGCAAAAAGCCTTCTTCGGTTCAGCTGATTAAGAGGCTTACATATACAAATTTATTAGGATTGGATGTTAGCCTGAGAAATGGAAGCTTAAAAGAAGGAACTCTCAACATGGAGTTATTGCAATTTAAATCGAGGTTTCCCCGTGAAGTTTTACTATGTAGAGTTGGAGATTTCTATGAAGCAATTGGATTTGATGCTTGTGTTCTTGTTGAGCATGCTGGTTTAAATCCTTTTGGGGGGTTGCGGTCTGATAGTATTCCAAGGGCTGGCTGCCCTGTTGTGAACTTGCGCCAAACATTGGATGACTTGACTCGAAACGGGTTTTCTGTTTGCATAGTTGAGGAGGTCCAGGGCCCAACCCAGGCTCGTTCTCGTAAAGATCGATTTATATCTGGCCATGCACATCCAGGAAGCCCTTATGTATTTGGGCTTGCTGGGGTTGACCATGATGTTGAGTTTCCTGATCCAATGCCTGTAGTTGGGATCTCACGTTCTGCAAAAGGATATTGCATGGTCTCGGTCCTAGAAACCATGAAAACATTTTCATCAGAAGATGGCCTTACAGAAGAAGCAATAGTTACCAAGCTCCGCACGTGCCGGTATCACCATTTATATCTGCACACTTCTTTGAGACAAAATTCTTCAGGTACTTCTCGTTGGGGAGAATTTGGTGAGGGGGGGCTTTTGTGGGGAGAATGTAATGGAAAGCCCTTTGAATGGTTTCATGGTGATCCTGTTGAAGAGCTTCTGTGCAAGGTAAGAGAGATATATGGTGTTGACCAAGAAACCACATTTCGAAATGTTACTGTATATTCAGAGAGAAGGCCTCAACCTTTGTATCTTGGAACTGCAACTCAAATAGGAGTCATACCAACTGAGGGAATACCTAGCTTGTTGAAGGTTTTGCTTCCTGCAAACTGTGTTGGCCTTCCAATATTGTATATTCGAGATCTTCTTCTTAATCCTCCCACTTATGAGATTGCTTCGGCAATTCAAGAGGCATGCAGGCTTATGAGCAATGTGACTTGTTCAATCCCTGAGTTTACTTGCATATCAGCACCAAAGCTTGTGAAATTGCTCGAGTCGAAGGAGGCTAATCATGTAGAGTTCTGTAGAATAAAGAATGTAGTTGATGAAATTCTGCAGATGGATAAAATCACTGAGCTTTCTACAATCCTACGCATACTGTTGGAACCTACTTGGGTAGCAACTGGACTGAAAGTTGAACATGATAGACTGGTGAATGAATGCAGTTTGGTTTCACAAAGGATAGGTGAAATAATCTCCTTGAGTGGTGAAAGTGATCAAGAAATAAATTCATTCGAATTCATTCCTAGAGAGTTCTTTGAGGATATGGAATCATCATGGAGAGGCCGTGTGAAGAGGATCCATGCAGAGGAGGCATTTGCAGAAGTGGAGAGGGCTGCCAAGGCCTTATCTGTTGCAGTCATGGAAGATTTTTTTCCAATTGTTTCAAGAGTGAAGTCTGTCGTCTCTCCTCTTGGAGGTCCAAAGGGTGAAATATGTTATGCAAGAGAGCATGAAGCTGTTTGGTTTAAAGGAAAGCGTTTCATGCCAGCTGTGTGGGCTAACACCCCTGGGGAAGAACAAATCAAGAAACTCAGACATGCTACGGATTCAAAAGGGAGAAAGGTTGGAGAGGAATGGTTTACCACAATAAAAGTGGAGGATGCTCTAAACAGGTATCATGAAGCCAGTGATAAAGCCAAGAACAAAGTTTTGGAGTTATTAAGAGGACTTTCTGGTGAATTGCAGACAAAAATTAACATTCTTGTTTACTCTTCCATGTTGCTTGTAATAGCGAAGGCACTTTTTGGTCATGTTAGTGAAGGCCGAAGAAGGGAATGGGTGTTTACTAAGCTCAAGGAATTTCAGAGTCCTGAGGATAAGTCAGCAGGAAATATTAACATAATGGAGTTATCAGGATTATCTCCTTATTGGTTTGATGTTGCGCAAGGCAATGCCATACAGAACACTGTTAAAATGCACTCACTATTCCTTCTGACTGGGCCAAATGGTGGTGGTAAATCTAGTTTGCTTCGGTCAATTTGTGCTGCTGCATTGCTTGGAATTTGTGGGCTTATGGTGCCTGCTGAGTCAGCTGTCATTCCTCATTTTGATTCTGTTATGCTGCACATGAAAGCTTATGATAGTCCTGCTGATGGGAAAAGTTCATTTCAGATTGAGATGTCGGAAATGCGCTCCGTAATCACTAGAGCTACCCGAAGGAGCTTAGTTCTTGTGGATGAAATCTGTAGAGGCACAGAAACTGCAAAAGGAACCTGTATTGCTGGTAGCTTTGTTGAGATGCTTGATTGCACTGGCTGCCTGGGCATTGTATCAACCCATTTGCATGGCATTTTCGACTTGCCTTTAGCCACAAAAAATACTGTCCACAAAGCAATGGGAACAGAGGTTGCAGATGGCCGCATAAGACCAACATGGAAGTTGATAGATGGAGTCTGTAGAGAGAGTCTTGCCTTTGAAACTGCCCAGAAGGAAGGCATTCCTGAAAAAATCATCCAAAGAGCTGAAGAGCTATACCTTTCAATGAATGTAACTGATGTACACATTTCTCCAAATTCTACAAAAGCTGAGCATTTCAATGCAAAGTTCTATGCAAGTGGTCTTGGTGAAATCAGTGATTCTTCGAGGACTAGTTTAGATTTTCTTCCTTTTGGCAGCTTGGAACTATTACAGAAGGAAGTCGAGAGTGCTGTTACCATAATCTGCCAGAAGAAGTTGTTAGAGCTTTACAAGAAGAAGAGCATATCTGAGCTTGCAGAGGTGATGTGTGTTGTAGTAGGTGCTAGGGAGCAGCCTCCTCCCTCAACTGTGGGCACTTCCAGCATCTATGTACTCTTCAGACCTGACAAGAAATTATATGTTGGACAGACGGATGACCTAGTGGGCCGCGTTCGTGCTCATCGTTCCAAGGAAGGCATGCAAAATGCGGAGTTCCTATATGTTGTAGTACCAGGAAAGAGCATTGCAAGTCAGCTTGAGACTCTTCTCATCAACGAACTTCCCCTTCGAGGCTTCAGGCTCGTCAACAAAGCTGACGGTAAGCATCGTAATTTCGGCACATCTAGACTCCCCAAGGAACCTGTTAAGTTGCACCAATGA |
Protein: MHRVVTSSLVASSPRWLSLVGFLRSSTIRRFYKTPFPTWCCKLVERKYCSNSHEILVGVPKASRRLKQSKILYEVDNQSHILWWKEKMQMCKKPSSVQLIKRLTYTNLLGLDVSLRNGSLKEGTLNMELLQFKSRFPREVLLCRVGDFYEAIGFDACVLVEHAGLNPFGGLRSDSIPRAGCPVVNLRQTLDDLTRNGFSVCIVEEVQGPTQARSRKDRFISGHAHPGSPYVFGLAGVDHDVEFPDPMPVVGISRSAKGYCMVSVLETMKTFSSEDGLTEEAIVTKLRTCRYHHLYLHTSLRQNSSGTSRWGEFGEGGLLWGECNGKPFEWFHGDPVEELLCKVREIYGVDQETTFRNVTVYSERRPQPLYLGTATQIGVIPTEGIPSLLKVLLPANCVGLPILYIRDLLLNPPTYEIASAIQEACRLMSNVTCSIPEFTCISAPKLVKLLESKEANHVEFCRIKNVVDEILQMDKITELSTILRILLEPTWVATGLKVEHDRLVNECSLVSQRIGEIISLSGESDQEINSFEFIPREFFEDMESSWRGRVKRIHAEEAFAEVERAAKALSVAVMEDFFPIVSRVKSVVSPLGGPKGEICYAREHEAVWFKGKRFMPAVWANTPGEEQIKKLRHATDSKGRKVGEEWFTTIKVEDALNRYHEASDKAKNKVLELLRGLSGELQTKINILVYSSMLLVIAKALFGHVSEGRRREWVFTKLKEFQSPEDKSAGNINIMELSGLSPYWFDVAQGNAIQNTVKMHSLFLLTGPNGGGKSSLLRSICAAALLGICGLMVPAESAVIPHFDSVMLHMKAYDSPADGKSSFQIEMSEMRSVITRATRRSLVLVDEICRGTETAKGTCIAGSFVEMLDCTGCLGIVSTHLHGIFDLPLATKNTVHKAMGTEVADGRIRPTWKLIDGVCRESLAFETAQKEGIPEKIIQRAEELYLSMNVTDVHISPNSTKAEHFNAKFYASGLGEISDSSRTSLDFLPFGSLELLQKEVESAVTIICQKKLLELYKKKSISELAEVMCVVVGAREQPPPSTVGTSSIYVLFRPDKKLYVGQTDDLVGRVRAHRSKEGMQNAEFLYVVVPGKSIASQLETLLINELPLRGFRLVNKADGKHRNFGTSRLPKEPVKLHQ |